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Abstract 

Suppose that we intend to perform an experiment consisting of a set of independent trials. 
The mean value of the response of each trial is assumed to be equal to the sum of the effect of 
the treatment selected for the trial, and some nuisance effects, e.g., the effect of a time trend, 
or blocking. In this model, we examine optimal approximate designs for the estimation of a 
system of treatment contrasts, with respect to a wide range of optimality criteria. 

We show that it is necessary for any optimal design to attain the optimal treatment 
proportions, which may be obtained from the marginal model that excludes the nuisance 
effects. Moreover, we prove that for a design to be optimal, it is sufficient that it attains 
the optimal treatment proportions and satisfies conditions of resistance to nuisance effects. 
For selected natural choices of treatment contrasts and optimality criteria, we calculate the 
optimal treatment proportions and give an explicit form of optimal designs. In particular, we 
obtain optimal treatment proportions for comparison of a set of new treatments with a set 
of controls. The results allow us to construct a method of calculating optimal approximate 
designs with a small support by means of linear programming. As a consequence, we can 
construct efficient exact designs by a simple heuristic. 
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1 Introduction 


The results of an experiment may be affected by conditions with effects that we aim to 
estimate and by other conditions with nuisance effects. For example, any experiment that 
consists of multiple trials performed in a time sequence may be subject to a nuisance time 
trend caused by the ageing of the material used for the experiment, wearing down of the 
experimental devices, changes in the temperature, etc. Many agricultural experiments are 
subject to a two-dimensional nuis ance trend, re s ulting fro m the arrangement of the tr ials in 
a two-dimensional field, see, e.g.. IJacroux et al.l 1997l | and iBailev and WilliamsI 2007|. The 


objective of the experimental design in such cases is to eliminate the nuisance effects, or to 
provide as much information as possible on the effects of interest. 


The aim of this paper is to provide the <i>-optimal approximate designs for estimating a 
system of contrasts of treatment effects under the presence of nuisance effects, where $ is a 
given optimality criterion. Particularly, we aim to provide designs optimal under the presence 
of nuisance time trends. 

There is a large amount of literature on exact designs in such models, especially on block 


designs le.g.. lMaiumdar and Notz 


Cox 




19831 iMaiumdar 19961. 


Cheng 


199 


1 


■Tacroiix 


Bradley and Yeh 


Atkinson and Donev 

198n| 


1002 [) , but also on trend 


Jacroux et al 


19961) or b lock designs 


1997 1). However, 


resistant designs (e.g., 
in the presence of a trend (e.g., 
these results are usually tailored for a particular model and a system of contrasts, often with 
limiting assumptions (e.g., the relationship between the number of blocks and treatments, 
or tre nds modelled by polynomials of given degrees). In approxima te theory, 


19^1 studied optimal block designs for estimating centered contrasts, 


Pukelsheim 


Giovagnoli and Wvnn 


1985l | obtained optimal block de signs for compa ring treatments with a control with respect 
to Kiefer’s optimality criteria and 


Schwabe 


1996j studied product designs in additive models. 

The results on approximate designs are usually simpler and more general than the results 
on exact designs, therefore they provide a valuable insight into the qualitative aspects of the 
design problem. Moreover, the optimal approximate designs facilitate the computation of 
informative lower bounds on the efficiency of exact designs. However, it is not always clear 
how to convert approximate designs to exact designs that can be used for an actual, finite-size 
experiment. In this paper, we provide conditions of approximate optimality of designs for the 
estimation of any system of contrasts in a general additive model with any system of nuisance 
effects. Moreover, we demonstrate that the conditions can be employed to construct efficient 
exact designs by means of linear programming. 
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We show that a ^-optimal approximate design may be obtained in two steps: (i) Cal¬ 
culate ^-optimal proportions of treatment replications (treatment weights). These optimal 
proportions depend on the choice of contrasts of interest and on the optimality criterion <i>; 
however, they do not depend on the nuisance effects, (ii) Subject to keeping the optimal 
proportions of treatment replications, distribute the treatments to nuisance conditions such 
that the resulting design is resistant to nuisance effects. The designs resistant to nuisance 
effects are an exten s ion o f the designs orthogonal t o the time trend (balanced for trend, or 
trend-free, cf. 


Cox 


195l| . IJacroux and Ravi 1990|) to a more general class of models and 


treatment contrasts. 


Schwabe 


Schwabe 

1996 

and 

Kunert 


198 


i. 


informat i on is r etained in a finer model was used, e.g., in 

1996| studied optimal product designs, un like the prese nt paper, where optimal 


Kunerf 


198:i st udied exact de signs in 


Kiefer 


1975l |. means 


designs with non-product structure are provided too; 
the case of universal optimality. Universal optimality, formulated by 
optimality for estimating a maximal system of orthonormal contrasts (which is a special case 
of the general system of contrasts that we consider), with respect to a wide range of criteria. 

For selected systems of treatment contrasts and a wide class of optimality criteria $, we 
calculate ^-optimal treatment weights and thus obtain a class of ^-optimal designs. For 
instance, for the estimation of contrasts for comparing a set of new treatments with a set 
of controls, we provide MU-optimal designs and optimal designs with respect to Kiefer’s 
$p-optimality criteria, p G [—oo,0], including A- and K- optimality (p = —I and p = —oo. 


Giovagnoli and Wvnn 


19851 . who 


respectively). These results generalize the results given by 
obtained <i>p-optimal block designs for comparing treatments with one control. For any com¬ 
pletely symmetric system of contrasts, we show that the uniform design is <i>-optimal for all 
ort hogonally invarian t information functions, which generalizes, for instance, a result given 
by 


Pukelsheim 


1983 1. 


The obtained results may be used to analytically construct optimal approximate designs. 
A special case of <i)-optimal designs resistan t to nuisance effe cts are the product designs with 
^-optimal treatment proportions (cf., e.g.. 


Schwabe 


1996|), but the approximate product 


designs have a large support, which makes the transition to exact designs difficult. However, 
the set of optimal approximate designs is typically large and both the conditions of optimal 
treatment weights and the conditions of resistance to nuisance effects are linear. Therefore, 
we can employ the simplex method of linear programming to obtain optimal approximate 
designs with a small support. This allows us to construct efficient exact designs using a 
simple heuristic, especially in the presence of nuisance trends. 
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In the rest of Section [TJ our notation and the statistical model is established. The main 
theoretical results are proved in Section [2l In the same section, we provide optimal treatment 
proportions for estimating particular sets of contrasts. Examples of experiments under the 
presence of nuisance effects are provided in Section |31 The theoretical results are applied in 
Section|3]to obtain optimal approximate designs with small support and consequently efficient 
exact designs. 

1.1 Notation 

The symbols and 0„ denote the column vectors of length n of ones and zeroes, respectively. 
The symbol Jn denotes the n x n matrix J„ = 1^1^ of ones and is the u-th standard unit 
vector (the u-th column of the identity matrix where n is the dimension of e„). By the 
symbol Omxn, or by 0 if the dimensions are clear from the context, we denote the m x n 
matrix of zeroes. We denote the null space and the column space of a matrix A by Af{A) 
and C{A), respectively. By the symbol 6^ we denote the set of s x s non-negative definite 
matrices and by ^ we denote the Loewner ordering of matrices in ©^, i.e., A < B ii B — A is 
non-negative definite. Let x = (a;i,..., be a vector with non-zero components, then by 
x~^ we denote the vector x~^ := By diag('(;i,... ,Vk), where vi,... ,Vk are 

column or row vectors, we denote the diagonal matrix with diagonal elements corresponding 
to the elements of vi,... ,Vk- 

1.2 Statistical Model 

Consider an experiment consisting of N trials, where in each trial we choose one of v treatments 
(u > 2). The response of the i-th trial is determined by the effect of the chosen treatment 
u{i) and by the effects of nuisance experimental conditions t{i) from a finite set T, |T| =: n < 
oo. 

We assume that the model is additive in the treatment and nuisance effects and that it 
can be expressed as 

yz = Tu(^) + h'^ + Ei, i = (1) 

where Yi,..., Yn are the observations, 6* is a d x 1 vector of nuisance parameters, /i : T —^ 
is the regressor of the nuisance experimental conditions, and £i,..., Eat are independent and 
identically distributed random errors with zero mean and variance cr^ < oo. Suppose that 
we aim to estimate a system of s contrasts Q^t, where t = (ti, ... and Q is a u x s 

matrix satisfying Q'^ly = Og. We will assume that Q has full rank s, unless stated otherwise. 


4 


Moreover, we will assume that we are interested in all treatments 1,..., u, i.e., each treatment 
is present in Q (no row of Q is Oj). We consider 0 to be a vector of nuisance parameters. 
The model © can be expressed in the linear regression form 


Y, = f’^{x^)l3 + Ei, i = 

where Xi = G X, X = {1,..., u} x T, f{u, t) = (ej, h'^, /3 = (r^, The 

objective of the experiment is to estimate a system of contrasts it'^/3, where = (Q^, Osxd) • 
Let the approximate design of experiment (or, in short, design) be a function ^ : X —>■ 
[0,1], such that ^ ^{x) = 1, where ^(x) represents the proportion of trials to be performed 
in X G X. Hence, an exact design of experiment of size N is represented by a function 
^ : X —>■ {0,1/7V, 2/iV,such that ^ ^(x) = 1, where iV^(x) is the number of trials in 


the design point x G X. 

The inf ormation matrix o f the design ^ for estimating jS is the non-negative definite 
matrix (see 


Pukelsheim 


2006|) 


Nk{0 — 


( 2 ) 


where M(^) =J2xex ^(x)/(x)/^(x) is the moment matrix of the design ^ and the minimiza¬ 
tion is taken with respect to the Loewner ordering ^. It is well known that the system K'^j3 
is estimable if and only if C{K) C C{M{^)). When iG^/3 is estimable under we say that ^ 
is feasible for fi. In such a case, the information matrix of ^ is Nk{C) = M~, 
where M~{^) is a generalized inverse of M{^). 

Let $ : —>■ K be an optimality criterion. Then, a design is said to be $-optimal 

if it maximizes ^(^Nk{0) among all feasible designs A widely used class of optimality 
criteria are the Kiefer’s $p criteria. Let H he a positive definite s x s matrix with eigenvalues 
..., As (id), and let XminiH) be the smallest eigenvalue of H. Then, 


%(H) = 


(iEy(H))‘", 

(ftA,m)‘", 


j=i 
-^min (H), 


p G (-00,0), 

p = 0, 

p = —oo. 


If H is singular, we set d)p(id) = 0. For p = 0, —1 and —oo, we obtain the D-, A- and E- opti 
mality criterion, respectively. Note that criteria are information functions (see 


Pukelsheim 


200(ll |L in particular they are Loewner isotonic, positively homogeneous and concave. 
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We will investigate further the properties of experimental designs in model ([T]). The 
moment matrix of a design ^ may be expressed in the form 


where 


MiO = 


MniO MMO 
MUO M22{0 






Mii(0 = diag 
Mi2(C) = 

M22{0 = 

tel \u=i ) 


Civ,t) 


Ktei / 


Let us denote the Schur complement of the moment matrix M(^) as Mt{^) = — 

Mi 2 (^)M 2 ^(^)M 2 i(^). It is simple to show that the system iL^/3 is estimable under a design 
^ if and only if C(M^(^)) C C{Q). If ^ is estimable under the information matrix of ^ 

isNK{0 = {Q^M-{0Q)-^. 


2 Optimal Approximate Designs 

2.1 Preliminaries 

We say that w is a treatment proportions design if it is a design in the marginal model without 
nuisance effects 

Yi '^u{i) T ^ I; • • ■ ) ^' (^) 

That is, w is a function from {1,..., u} to [0,1] satisfying w{u) = 1. For a design ^ of 
o, the marginal design defined by w{u) = ^ represents the total weights of 

individual treatments in fy and it will be called the treatment proportions design of Since 
a design w always provides v weights r(;(l),..., w(v), we will often equivalently denote re as a 
z; X 1 vector of weights w = (wi, ..., Note that if ^ is an exact design of size N and w 

is its treatment proportions design, then Nw is the vector of replications of treatments in 
The properties of a treatment design w in model ([3]) are generally very simple to analyze. 
For instance, it is straightforward to show that the moment matrix of w is M{w) = diag(r(;). 
Moreover, the set of contrasts Q"'"t is estimable in ([3|) under w if and only if > 0 for all u. 
In such a case, the information matrix of w is evidently Nq(w) = ((3^diag(w’ -^)Q) '• 
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Similarly to the treatment replications design, we define nuisance conditions design a to 
be a design in the marginal model without treatments 


Yi = {t{i))9 + Si, i=l,...,N, 


(4) 


i.e., a is a function from T to [0,1] that satisfies = 1- F'or a design ^ of ([T]), the 

marginal design a{t) = ^ defines the proportions of trials to be performed 

under particular nuisance conditions, and it will be called the nuisance conditions design of 

e 


Proposition 1. Let f be a design in model m and let w be its treatment proportions design. 
Then, NKiO ^ Nq{w). 


The proof of Proposition |T] and all other proofs are deferred to the appendix. The propo¬ 
sition shows that by introducing nuisance effects, the information about the contrasts of 
interests can not increase. However, for a large class of designs, the information is exactly 
retained. 

We will say that a design f with its treatment design w is resistant to nuisance effects, or 
nuisance resistant for a given system of contrasts Q, if it satisfies 


^7 E 


7^ E ^iv,t)h{t) 


Q = 0. 


(5) 


The following proposition justihes this dehnition. 


Proposition 2. Let f be a nuisance resistant design with its treatment proportions design 
w > 0, then (i) ^ is feasible for K'^ff, (ii) K'^M~{£)K = w)Q and (Hi) f has the 

same information matrix as w, i.e., Nk{£,) = Nq(w). 

Note that the conditions of resistance to nuisance effects have also another desirable prop¬ 
erty: they are invariant with respect to a regular reparametrization of the nuisance regressors. 
That is, a design is resistant to nuisance effects with respect to nuisance regressors h if and 
only if it is resistant to nuisance effects with respect to nuisance regressors h = Rh, where R 
is any non-singular d x d matrix. 


In general, the class of designs resistant to nuisance effects depends on the chosen system 
of contrasts Q. Nevertheless, as we show, there is a large subclass of nuisance resistant designs 
that is invariant to the choice of Q, i.e., these designs satisfy ([S]) for any system of contrasts. 
We will say that a design ■C of (U) with its treatment design w > 0 is balanced if it satisfies 


4./~rr 


1 

W2 


^e(2,t)h(t) 

tGl 


Wv 


( 6 ) 
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If ^ is balanced, then for any k G {1,... ,d} the vector 

satisfies Sk = ak^v for some ak G K. Since Q is a matrix of contrasts, we have = Oj and 
hence sjQ = 0^. It follows that a balanced design ^ is indeed nuisance resistant. 

When the matrix of contrasts Q attains the maximum rank, u — 1, the null space Af{Q’^) 
has dimension 1 and it consists of vectors of the form al„ for a G R. Hence, for such Q, the 
balanced designs are the only nuisance resistant designs; i.e., in this specific but frequent case, 
the notions of resistance to nuisance effects and balancedness coincide. One consequence of 
this observation is that for given nuisance regressors h, the class of balanced designs is the 
intersection of the sets of nuisance resistant designs with respect to all possible choices of 
contrast matrices Q. 

We remark that the conditions ([S]) mean that a design ^ is balanced with respect to the 
nuisance effects. That is, for each regressor hk, the weighted average of the values hk{t) 
with weights ^(u,t)/u>„, t G T, is the same for each treatment u. Balancedness can also be 
understood geometrically: assume that for each u we calculate the barycentre of the n points 
h{t) G with weights ^{u,t)/wu, t G T. Then, these barycentres must be the same for all 
treatments u. 

A typical experimental situation is that we need to perform the same number of trials, 
usually one, under each nuisance condition t G T. In this case, it is straightforward to show 
that the conditions ([6]) may be expressed in a more compact form as follows. 

Proposition 3. Let ^ be a design which assigns the same weight to each nuisance condition, 
i.e., the nuisance conditions design of ^ is a = Inin. Then, f satisfies ^ if and only if it 
satisfies 

— f{u, t)h{t) = — h{t) for all u G {1,..., ?;}. (7) 

Wu n 

tGS iel 

In the case of an exact design f assigning one trial to each nuisance condition, the bal¬ 
ancedness of f has a straightforward interpretation. Suppose, for instance, that the nuisance 
conditions represent time and hi{t) is proportional to the room temperature at time t. For 
each treatment u, let T„ be the average temperature at the times of trials with the treatment 
u. Then, the balance conditions for hi{t) mean that the temperature conditions are “fair” for 
all treatments in the sense that the average temperatures are the same: = T for all u. 


Let zii be a treatment proportions design and a be a nuisance conditions design. Then, a 
design ^ is the product design of w and a if it satisfies 

^(u, t) = w{u)a{t) for all M G {1,, n}, t G T, 

which we denote ^ = w^a. Note that any product design w®a satisfies t)h{t) = 

a{t)h{t) for all u, therefore the product design is balanced and consequently, it is also 
resistant to nuisance effects. 


2.2 Conditions of Optimality 

The following theorem shows that the optimality of treatment proportions is a necessary 
condition of the optimality of a design in model O- 

Theorem 1. Let be an information function, let C be a ^-optimal design in model m 
and let w* be its treatment proportions design. Then, w* is ^-optimal in m- 

From Theorem[l]it follows that in order to find an optimal approximate design, we need to 
break up this process into two steps: obtain the optimal treatment weights and then optimally 
allocate these weights to nuisance conditions. Note that finding a ^-optimal treatment design 
is a convex u-dimensional optimization problem 

max $((Qdiag(r(;“^)Q^)“^), (8) 

Lt;>0, l'^w—1 

which can usually be easily solved numerically, and often analytically, as we demonstrate in 
Subsection 12.41 

Once the optimal treatment weights are obtained, we may allocate these weights to nui¬ 
sance conditions using the following theorem, i.e., by choosing a nuisance resistant design. 


Theorem 2. Let w* be a ^-optimal treatment proportions design. Let be a nuisance 
resistant design with its treatment proportions design w*. Then, C is ^-optimal and it has 
the same information matrix as w*, i.e., Nk{^*) = (Q^diag((r(;*)“^)Q) ^. 


The balanced designs are nuisance resistant, therefore, the balanced designs with $- 
optimal treatment weights w* are $-optimal. Moreover, they have the same information 
matrix as w*. Note that the set of optimal balanced designs is never empty, because it con¬ 
tains the set of product designs re* ® a with any a. Since a is any nuisance conditions design, 
the class of <l>-optimal designs for model © is very large (unles s n = 1). 


Similar results on optimality of product designs are given by iSchwabd |1996l | (cf. Theorem 
3.2) in a general additive model Yi = Po + fi {ui{i))l3i + /J(u2(i))/32 + £i- Note that general 
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nuisance resista nt designs, because they need not have product structure, are not covered by 


Schwabe 


1996|. 


Theorem [T] provides necessary conditions of optimality and Theorem [5] provides sufficient 
conditions of optimality. It turns out that for the wide class of strictly concave optimality 
criteria, we can provide conditions that are both necessary and sufficient for optimality of a 
design ^ in model ([T]). 

Theorem 3. Let $ fee a strictly concave information function. Then, a design f is ^-optimal 
if and only if (i) its treatment proportions design w is ^-optimal in model 0) and (ii) f is 
resistant to nuisance effects. 

Since the balanced designs are the only nuisance resistant designs for a system of contrasts 
of maximum rank, v — 1, we obtain the following corollary. 

Corollary 1. Let $ fee a strictly concave information function, and let Q be a matrix of 
contrasts of rank v — 1. Then, a design ^ is ^-optimal for estimating Q^t if and only if its 
treatment proportions design is ^-optimal in m and f is balanced. 


2.3 Rank Deficient Subsystems 

Until now, we always assumed that the v x s matrix Q has full rank. However, there are 
some frequently used sets of contrasts that do not satisfy this assumption. Such subsystems 


Pukelsheim 

2nnfil. 

by the matrix Q = 

Pukelsheim 

198.ll') 


by the matrix Q = — ^Jy which aims at estimating the centered effects of treatments (see 


In the rank deficient subsystem s, the information matrix Nfcif) is not well defined. In- 


Pukelsheim 


20^ . for a feasible design we define the matrix Ck{£,) '■= 


stead, following 

where the superscript + denotes the Moore-Penrose inverse. For K = 
(<3^,0)^, we get C'ic(^) = {Q^M~. Then, if ^{N) depends only on the eigenvalues of 
N, instead of maximizing $ defined on all eigenvalues of Nifff,), we maximize $ defined on 
the positive eigenvalues of Ck{£,)- 

For the full rank subsystem, the eigenvalues of the information matrix are the 

inverses of the eigenvalues of K"’'M~ (ff)K. Similarly, the matrix Cxif) satisfies that its non¬ 
zero eigenvalues are inverses of the non-zero eigenvalues of the matrix K'^M~{f)K. Thus, 
at least in the sense of their spectra, the matrices CKiff) are an analogue to the information 
matrices for full rank subsystems. 
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In the rank deficient case, results analogous to the full rank case hold. We will show 
that by introducing the nuisance effects, we cannot increase information about the treat¬ 
ment contrasts, as measured by Ck{C)- The ordering of matrices Ck{^) is induced by the 
inverse ordering of the matrices K'^M~{^)K. For any design we obtain K'^M~ {^)K = 
M~{^)Q and for its treatment proportions design Q'^M~{w)Q = Moreover, 

Mr{C) = — M^-}iC)Mnn{£.) MTn{£.) ^ therefore there exist generalized inverses 


that satisfy (^) ^ (see Wu 1980|) and it follows that K'^M {^)K >: Q^M {w)Q. 


As ^{N) depends only on the eigenvalues of N and the Moore-Penrose inverse has in¬ 
verse non-zero eigenvalues of AT, it implies that $((Ar^M“(^)Ar)+) > {w)Q)+), 

i.e., > <I’(Cq(u;)). 


From part (ii) of Proposition [5] it follows that any nuisance resistant design ^ has the same 
matrix Ck{C) its treatment proportions design, i.e., Ck{0 = Cq{w). Hence, Theorems [T] 
and [2] hold even in the rank deficient case. 

Theorem 4. Let $ he an information function and let Q be a v x s matrix of contrasts with 
rank((5) < s. Let w* be a ^-optimal design for estimating Q^t in model Then, the 

following holds 

(i) Any nuisance resistant design whose treatment proportions design is w*, is ^-optimal 
for estimating Q^t and Cxi^*) = (Q^diag((w*)“^)(5)^. 

(ii) Iff* is a ^-optimal design in model m and w* is its treatment proportions design, then 
w* is ^-optimal for estimating in model 

In particular, we obtain optimality of balanced and product designs with optimal treatment 
weights. 


2.4 Optimal treatment proportions for selected systems of contrasts 


We say that the system of contrasts Q^t is completely symmetric if QQ^ is completely 
symmetric. It is easy to show that such Q must satisfy = a(/„ — Jy/v) for some a > 0. 
We will show that some common systems of contrasts are completely symmetric. 

We consider information functions $ that are orthogonally invariant, i.e., ^{UNU'^) = 
$(A^) for any orthogonal matrix U. Note that a function $ is o rthogonal l y inva riant if and 
only if $(Af) depends only on the eigenvalues of N (see, e.g., 
details). 


Harman 


20041 for further 
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Theorem 5. Let t be a completely symmetric system of contrasts. Then the uniform 
treatment proportions design w = ly/v is ^-optimal for estimating Q^t with respect to any 
orthogonally invariant information function 


By a maximal system of orthonormal contrasts., we mean a set of z; — 1 contrasts that 
are orthogonal to each other and have norm 1, i.e., qi,..., qv-i satisfy q[qj = 0 for i ^ j 
and qj qi = 1 for all i. Note that a special case of the maxima l system of orthonormal 
contrasts are the Helmert contrasts (see, e.g., ICox and ReidI 2000|, Appendix C). Since Q 


is a z; X (u — 1) matrix of orthonormal contrasts, the matrix [Q, l„/i/z7| is orthogonal. It 
follows that Q^Q = Iv-i and QQ^ = Iv — Jv/v, thus it is a completely symmetric system. 
It is easy to verify that the information matrix of a treatment proportions design zc > 0 is 
Nq{w) = Q^M{w)Q — Q^M{w)JyM{w)Q and in particular Nq{w) = v~^Iy-i. 

Consider a system of centered treatment effects, or centered contrasts, which is the system 
of contrasts ti — f,... ,Ty — f, where f is the mean of the treatment effects. That is, Q = 
ly — Jy/V, which is a z" X z; matrix of rank z; — 1 and thus Q'^t is a rank deficient system. Note 
that QQ^ = Q is completely symmetric and hence the c entered contrasts a re a completely 
symmetric system of contrasts. In Section 5 of the paper Pukelsheim IQSsj . this system of 


contrasts was analyzed in great detail for a special case of model o, the block designs, and 
the optimality of product designs with uniform treatment weights was obtained. The matrix 
Cq{w) of the uniform treatment design satisfies Cq{w) = (vQ'^Q)^ = v~^Q = ly/v — Jy/v"^. 

By a system of all pairwise comparisons w e mean the system of Ti — Tj for all i > j 
(considered in, e.g., Bailev and Cameron 2009| b The corresponding v x matrix Q 

satisfies QQ^ = vly — Jy and thus the system is completely symmetric. 

Corollary 2. The uniform treatment design w is ^-optimal for estimating the system of 
orthonormal contrasts, the system of centered contrasts as well as the system of all pairwise 
comparisons, with respect to any orthogonally invariant information function $. 

Consider an experiment which aims at comparing two sets o f treatments. Exa ct de signs for 


these contrasts were studied in multiple de sign settings, e.g. in iMaiumdar 


2002 Ijn block expe riments 


Jacroux 


Jacroux 


Maiumdar 

1986 

and 

Jacroux 


1990 1 stud ied A- and MP-optimal designs in model (|3]), 


1993l | and ICithinii and JacrouxI 1998l| considered the presence of trends. Without 


loss of generality, let the first set consist of the first g (control) treatments, 0 < g < v/2, and 
the second set be the set of the remaining v — g treatments. Then the aim is to estimate all 
treatment comparisons Tj — Ti, where 1 < i < g and g + 1 < j < v, whic h leads to matrix 
Q = {—Ig 0 l«-g, Ig 0 ly-gY", where 0 denotes the Kronecker product. In lMaiumdan |l986 | 
the author suggests that such situation may arise when comparing two ’packages’ of treatments 
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or in comparing a set of new treatments with a set of standard (control) treatments. We will 
call such system of treatment contrasts comparison of treatments with controls. This system 
of contrasts naturally generalizes the standard system for comparison of u — 1 treatments with 
one control, r 2 — ti, ..., — ri, where g = 1 . 

Theorem 6 . Let p G [—oo, 0]. If p > —oo, let 7 p be the unique solution of the equation 

{v-g- 1)7^-^ - (5 - 1)(1 - 7)'“'’ + 27 - 1 = 0 (9) 


in the interval (0,1/2] and let 7 _oo = 1/2. Then the treatment proportions design that satisfies 
Wi = ... = Wg = ^p/g and Wg+i = ... = = [\ — 7 p)/(u — g) is ^p-optimal for eomparison 

of {v — g) treatments with g eontrols, 0 < g < u/ 2 . 


We note that for any p G (— oo, 0] and v > 2 there exists a unique solution jp of the 
equation ([HI) in the interval ( 0 , 1 / 2 ), which is moreover numerically easy to calculate, because 
the function F{'y) = {v — g — 1 ) 7 ^“^ — {g — 1)(1 ~ + 27 — 1 is an increasing convex 

function for 7 G (0,1/2) that satisfies F(0) < 0 and F(l/2) > 0. 

The obtained optimal treatment proportions which depend on the choice of criterion <l>p 
generalize the results obtained for block designs bv lGiovasmoli and WvnnI 19851 ] on comparison 
with (one) control. The optimal proportions given by Theorem| 6 ]are characterized by a single 
value, 7 p, the total weight of the first g treatments. In particular, for H-optimality, 70 = gjv 
and the optimal proportions are uniform; for A-optimality, 7 _i = ~ ^tiich lies in 

( 5 /u, 1/2); and for i?-optimality, 7 _oo = 1/2, i.e., to each of the two sets of treatments, half 
of the total weight is allocated. 


For comparison with controls, it is common to also use the criterion of My-optimality 
which minimizes the maximum variance of the contrasts of interest. It turns out that MY - 
optimal and A-optimal treatment proportions are the same. It follows that the A- and MY- 
optimal nuisance resistant designs are the same. 

Theorem 7. Let 7 = let wx = ... = Wg = ^ jg and Wg+i = ... = Wy = 

(I — "f)/{v — g). Then w is MY-optimal for comparison of {v — g) treatments with g controls, 
0 < g < v/2. 

Once optimal treatment proportions are calculated, optimal nuisance resistant (balanced, 
product) designs can be constructed, retaining the same information matrix as their treatment 
proportion designs. Note that the matrix Q for completely symmetric contrasts and for 
comparison with controls has rank v—1, therefore for such systems, the balanced designs and 
nuisance resistant designs coincide. 
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3 Examples 

3.1 Trend Resistant Designs 

Let us consider a model where we perform the trials in a time sequence, in each time exactly 
one trial, and the nuisance effect is the effect of some time trend 


Yi = Tu(i) + hi{t{i))9i + ... + hd{t{i))9d + £*,*€ {1, 


( 10 ) 


where u{i) G {l,...,u} represents the chosen treatment and t(i) G {1 ,...,7 t,} denotes in 
which time the treatment is to be applied in trial i. The functions /ii,..., /id : K. —>■ R are the 
regressors of the time trend, often chosen to be polynomials of degrees 0 ,..., d— 1 respectively. 

The interest in de signs that perform well under model (1101) dates back to the mid-20th 
century, e.g., in paper 


Cox 


195l| . The research focus is usually on combinatorial construction 


of exact designs orthogonal to time trend (or trend free). These are designs that satisfy that n o 
information is lost due to the time trend (see, e.g.. IJacroux et al.l |1997l |. lBailev et all 1992| L 


Usually, the focus is on all parameters of interest, not on a system of contrasts Q, resulting in 
the cond ition that a design is trend free with respect to hk in model (fTUl) if t)hk{t) = 0, 


Cox 


1951 |. Such trend free designs satisfy Mi2(^) = 0 and thus Mr{^) = Mii(^). 


see, e.g. 

The drawback of the combinatorial approach is that it is usually tailored for a very specific 
model. For example, the theoretical results on orthogonal designs require the number of design 
points to be a multiple of the number of treatments, the time points to be evenly spaced and 
the time trend needs to be represented by a polynomial. However, these conditions often do 
not hold. The reader may find a s urv ey of the literature on the t rend resistant experimental 


Cheng 


199d| or 


Atkinson and Donev 


1996|. 


designs in the papers 

Note that the orthogonal designs satisfy m and thus they are balanced. However, since 
we aim at estimating a set of treatment contrasts Q, the stringent conditions of orthogonality 
need not hold for the information to be retained. If ^ is resistant to nuisance effects, the 
equality Mr{C) = Afii(^) in general does not hold, but such ^ satisfies Nk{C) = ^q{w), i.e., 
the designs resistant to nuisance effects eliminate the effects of the time trend. We remark 
that when h{t) = 0, the conditions of orthogonality and the conditions of balancedness 
coincide. 


We will examine the model with trigonometric ti me trend of degree D G N, which can be 


used to model, for instance, circadian rhythms (cf. 


Kitsos et al 


1988|). For simplicity, let 
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(j)n = 27r/n and consider the model 

Yt = Tu{t) +Oo + 01 cos{(j)„t) + 02 sm{(j)nt) + ... (11) 

+ 02D-1 COs{D(j)„t) + 02D Sm{D(j)nt) + £t, 

where t = 1 , 2 ,..., n. 

An exact design ^ will be represented by a sequence of treatments determining which treat¬ 
ments are to be chosen in which times. Note that the regression functions satisfy hk{t) = 0 
for k > 0, i.e., the notions of orthogonal and balanced designs for this model coincide. 

Using Theorem ^ we get that by repeating a sequence of treatments with <i>-optimal 
treatment weights, we may obtain a ^-optimal design for model (ITT]) of high degree. 

Proposition 4. Let $ be an information function. Let I G N and let be an exact design 
of size I with ^-optimal treatment proportions for estimating contrasts Q'^t. Let m £ N. 
Then, the exact design ^ = ^p^p...^p of size n = Im formed by an m-fold replication of fp is 
^-optimal for all trigonometric models (ED of degrees D < m. 

It is in fact possible to show that the design ^ from Proposition |T] is cf>-optimal for models 
of the type m of any degree, but they cannot include the terms cos(a(/)„t) and sin(a^nt), 
where a is an integer multiple of m. 

We demonstrate the results given by Proposition |T] on a simple example. 

Example 1. Consider the model 

Yt = Tt + 00 + 01 sin(r) -|- 02 cos(r) -|- 0^ sin(2r) -|- 04 cos(2r) -|- 05 sin(3r) -|- 06 cos(3r) -|- et, 

where r = ^t and t = l,2,...,n. Let n = 16, v = 3 and fp = 2113. Then, the design 
= 2113, 2113, 2113, 2113 is E-optimal for comparison with one control. Let n = 12, v = 3 
and = 321. Then, the design ,^2 = 321,321,321,321 is H-optimal for comparison with 
one control. Moreover, let $ be an orthogonally invariant information function. Then, ^2 is 
^-optimal for estimating any completely symmetric system of contrasts. □ 

3.2 Block Designs, Row-Column Designs 

Consider an experiment, where the treatment units are arranged in b blocks. As usual, for each 
of the N treatment units, we choose one of v treatments. The response is then determined 
by the treatment effects and block effects. We assume that the treatment and block effects 
do not interact, i.e, we obtain an additive blocking experiment 

Yi = Tu(i) + + £i, i = 1,..., N, (12) 
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where u{i) S {1,..., z;} and t{i) G {1,..., 6 }. The designs of blocking experiments are called 
block designs. There is a large amount of literature on this topic, in particular the pa 


pers that consider treatment contrasts in block designs are, e.g.. iMaiumdar and Notz 


Pukelsheim 


198 


i- 


198 


i, 


Note that model m may be expressed as a special case of model o, where T = { 1 ,..., 6 }, 
n = b, 0 = (? 7 i,..., and h{t) = et € is the t—th elementary unit vector. For 
block designs, in the balance conditions ([5]) we obtain = ... = ^{v,t)/wv for all 

t G {1,..., 6 }, which leads to a product design ^ = w ^ a. That is, all balanced designs in 
model (1121) are product designs. Therefore, for a system of contrasts of rank w — 1 and a 
strictly concave information function $, from Theorem [3] it follows that all $-optimal designs 
are product designs. Note that, in general, the balanc ed incomplete block designs and the 
balanced treatment incomplete block designs (see, e.g.. IMaiumdar and Nota 1983j) are not 
balanced in the sense of conditions [01 


Block designs are often used for eliminating heterogeneity in one direction, e.g., caused 
by a nuisance time trend. If the position of a unit within a block affects the response as 
well, or in general, the hete rogeneity needs to be eliminated in two directions, we may use 
the row-column designs (see 


Jacroux 


1983)- Here, N experimental units are arranged in bi 


rows and 62 columns. The mean response is determined by the sum of the treatment, row 
and column effect, modelled as 


Yi — Tu(i) + ?7fc(i) + + Ci, i — 1 ,..., 


(13) 


where u{i) G {!,..., u}, k(i) G {1, . .., bi} and l{i) G {1,.. ., 62 } represent the row and column 
chosen for the z-th trial, respectively, and , fi(i) are the row and column effects. 

This model can also be expressed as a special case of model o, where T = {l,..., 6 i}x 
{ 1 ,...,& 2 }, n = & 162 , 9 = (? 7 i,..., 7 ?bi,(/)i,...,(()b 2 )^ and h{k,l) = G The 

balance conditions for the row-column model become Wi^ ^{l,k,l) = ... = w~^ ^{v, fc, 1) 

for all fc = 1,..., &i and k,l) = ... = w~^ 0 for alH = 1,..., 62 - That 

is, for any row (column) the ratio of the total weights of any two treatments i, j in the partic¬ 
ular row (column) is given by the ratio of the treatment weights Wijwj. In other words, for 
the design ^ to be balanced (and hence optimal, if ^ attains optimal treatment weights), the 
functions ^(zt, •, ■)/wu need to have the same row and column marginals for all u = 1 ,..., u. 

The block and row-column designs are c alled the desig ns for the one-way and two-way 
elimination of heterogeneity, respectively (see 


Jacroux 


19821 '). By combining the models (1121) 


and the blocking experiment under the presence of a nuisance time trend is obtained. 
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see, e.g. 


Bradley and Yeh 


3 


198(111 or 


Jacroux et al 


1997| . which we will examine further in 


Example ID 


4 Constructing Efficient Exact Designs 


By constructing product designs with optimal treatment weights, and calculating their cri¬ 
teria! values (or by analytically deriving optimal criterial values), we may assess the quality 
of the exact designs. More precisely, we can compute lower bounds on the efficiency of any 
given exact design by calculating its approximate efficiency with respect to the criterion d*, 
eff(^) = where is a ^-optimal approximate design. Moreover, as we demonstrate in 

this section, the balance conditions provide a tool for obtaining optimal approximate designs 
with small support and these designs can be used to construct efficient exact designs. 

We will focus on exact designs of experiments in which exactly one trial is to be per¬ 
formed under each nuisance condition. The problem of fin ding such optimal des i gns is in 
general a difficult discrete optimization problem, see, e.g., lAtkinson and DonevI |l996l | or 


Harman and Sagnol 


2015 1. 


Note that both the balance conditions (and, in general, the conditions of resistance to 
nuisance effects) and the conditions on $-optimal weights are linear. Hence, results provided 
in the previous sections can be used to calculate a balanced approximate design with ^-optimal 
weights employing linear programming, solving the problem 

min{c^a;|Ha; = 6, a; > 0}, (14) 

where x G K'"" represents a design ^ in the vector form, A consists of sufficient conditions 
of optimality and we are free to choose the the vector c of the coefficients of the objective 
function. Let us denote the set of all feasible solutions of (|14|) as V. 

The matrix A consists of 

(i) V equalities = Wu, u = l,...,ri, i.e., ^ attains the ^-optimal treatment 

weights, 

(ii) d{v — 1) equalities ^(1, t)/i(t) = w~^ u = 2,...,v, i.e., ^ is a 

balanced design, 

(hi) n equalities i-®-i under each nuisance condition exactly one trial is 

performed. 
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Once the ^-optimal treatment weights w* are obtained, a <i>-optimal design can be con¬ 
structed as a product w* (E) a for any nuisance conditions design a. However, in general, it 
is difficult to construct exact designs from the product designs, due to their regular structure 
and large support. To obtain an optimal design with small support, it is beneficial to em¬ 
ploy the simplex method of linear programming, whose output is an optimal design that 
represents a vertex in P, the set of feasible solutions of (IT^ . 

Proposition 5. Let ^ represent a vertex in V. Then, ^ contains at most v + {v — l)k -|- n — 1 
support points, where k is the affine dimension of the set 

From Proposition it follows that by employing the simplex method, we can obtain a 
^-optimal design with at most {v — l){k -I- 1) -|- n support points. As a special case, when 
a constant term 0q is present in the time trend, it may be ignored in the conditions in (1141) . 
because it does not increase the affine dimension of {h(t)}ti^%', reducing thus the upper bound 
on the number of support points by u — 1. 

Suppose that ^ satisfying (iii) has support of size n, the number of nuisance conditions. 
Then, ^ uniquely determines an exact design of size n. The number of support points in 
designs obtained by the simplex method is only slightly larger than n; it exceeds this minimum 
support size by {v — l){k + 1). Note that the number of exceeding support points does not 
depend on n, thus, even for increasing number of nuisance conditions, it remains sma h. _ 

We note that using the Caratheodory Theorem (cf. Theorem 8.2. in iPukelsheimI 200g), 
it is possible to obtain results similar to Proposition [5l but the Caratheodory Theorem does 
not provide an actual method of constructing a design with small support, unlike the simplex 
method. 


remains small. 


Pukelsheim 


2006 


From an optimal approximate design with small support, an efficient exact design can be 
constructed by rounding, or often even by a complete enumeration of treatments in a small 
number of nuisance conditions. 


Example 2. Consider an experiment of performing trials in a time sequence 

Yi = ru{i) +0o + 0ihi{t{i)) + Si, i = l,...,n, 

where hi{t) = e-' represents an exponenti al time trend (e.g., the deca y of wool in the 

Assume that 


Atkinson and Donev 


1096l |l. 


experiment of wool processing, as suggested by 
V = 5 and the objective is to find an A-optimal design for comparing 3 treatments with 2 
controls, i.e., 7 _i = \/6 — 2 fa 0.45. We will provide optimal balanced designs with small 
support by employing the simplex method of linear programming {linprog function of Matlab, 
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using the simplex algorithm). Note that also the interior point (default) algorithm of Matlab’s 
linprog tends to provide optimal vertex solutions or optimal solutions with small support; as 
such it can be used instead of the simplex algorithm. 

First, let n = 8. Since Oq is the constant term, from Proposition [5] it follows that there are 
at most n—l + v + {v—l)D= 16 linearly independent rows of A in (fHl) . where D = d—1 = 1. 
We remark that although we are free to choose the vector c in the linear program, the support 
size of the design obtained by the simplex method does not seem to depend on the choice of 
c. Therefore, we chose each of the elements of c uniformly randomly from (0,1). 

We obtained a design that has the support of size 16 (and the minimum support size 
is 8) and is “fixed” in 4 times (i.e., in each of these times has only one non-zero element), 
see Table [TJ The support size corresponds to the bound 16 given by[SJ 


u\t 

1 

2 

3 

4 

5 

6 

7 

8 

1 

0 

0.1250 

0.0560 

0 

0 

0 

0 

0.0437 

2 

0 

0 

0.0690 

0 

0 

0.1250 

0.0059 

0.0249 

3 

0.0245 

0 

0 

0.1250 

0 

0 

0 

0.0340 

4 

0.0154 

0 

0 

0 

0.1250 

0 

0.0207 

0.0224 

5 

0.0851 

0 

0 

0 

0 

0 

0.0984 

0 


Table 1: A-optimal balanced approximate design obtained by the simplex method. The first 
two treatments are controls. 

By a complete enumeration of the possible treatment combinations in the remaining 4 
non-fixed times, we chose the design = 51234215 that maximizes the criterial value. For 
V = 5 and n = 8 it is possible to find the A-optimal exact design by a complete enumeration, 

= 41253214. It turns out that the design has efficiency 1 relative to (*, i.e., it is optimal; 
in fact, can be obtained by relabelling treatments 3,4,5 in ^*. Note that, in general, the 
proposed heuristic does not provide optimal exact designs. 

For n = 100 and the same model assumptions, we obtained a design ^2 with support of 
size 108. That is, the number of support points of ^2 exceeds the minimum support size again 
by 8; moreover ^2 has only 5 non-fixed times. Therefore, even for n = 100, an efficient exact 
design may be constructed by a complete enumeration of treatments in the non-fixed times. 
The resulting design ^2 assigns 23, 22, 19, 18, 18 trials to treatments 1,...,5, respectively, 
which corresponds to the ^-optimal treatment weights given by 7 _i ft! 0.45. Moreover, ^2 has 
approximate efficiency 0.994; its efficiency relative to the optimal exact design would be even 


19 







higher, but for the problem of this size, it is infeasible to compute an optimal exact design by 
a complete enumeration. □ 

In the following example, we demonstrate for various values of v, n, d that the simplex 
method provides optimal approximate designs with small support. 

Example 3. Consider an experiment of performing trials in a time sequence which aims at 
comparing treatments with control and the observed values are a subject to a polynomial 
time trend 


Yi = Tu{i) + 9opo{t{i)) + eipi{t{i)) + ... + 0DPD{t{i)) + Si, i = 


where po,... ,pd are discrete orthogonal polynomials of degrees 0 ,..., Z?, respectively, i.e., 
= 0 for i 7 ^ j. Furthermore, we set po = 1 and Pi{l) = 1 for all i. Note that 
although the total number of time trend parameters is d = D 1, the term Oopolt) — 9 q 
represents the constant term and thus, from Proposition [5] it follows that there are at most 
n + (u — 1)(I? + 1) linearly independent rows of A in (|T^ . 

For varying v, n and D, we calculated an A-optimal design for comparison of treatments 
with one control using the simplex method and we compared the size of its support with the 
minimum size of the support and with the theoretically derived bounds given by Proposition 
[5] (see Table [2]). 

□ 


Bradley and Yeh 


19^. We have a blocking 


Example 4. Consider the model given by 
experiment of b blocks, each of size I, where the response of a trial is also influenced by a 
common trend effect determined by the position of the unit within the block. In each block, 
there is exactly one trial performed on each position. Moreover, the trend effect in position 
t 2 {i) does not depend on the particular block tiii). We have 


Y^ =Tu(i) + p^ {t2{i))p + Si, i = l,...,n. 


(15) 


where ti(i) G {!,...,&} is the block in which trial i is performed, rjt^ is the effect of the 
ti-th block, t 2 G {1, ■ ■ ■ -,1} denotes the position of the unit within the block, n = bl, ip is a, 
(Z? + 1) X 1 vector of nuisance trend effects and p : R —^ is a regression function of the 

nuisance trend. 

Assume that v = 3, 6 = 3 and l = 8 ,n = 3x8 = 24 and that the time trend is modelled 
by discrete orthogonal polynomials po,pi,p 2 of degrees 0,l,2,i.e.,Zd = 2. We aim to find an 
ZH-optimal design for comparing treatments with one control. The optimal weight of the first 
treatment is 7 * = 1/2 and the optimal weights of the other two are 1/4. 
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V 

n 

D 

Simplex 

Max. Simplex 

3 

120 

1 

124 (4) 

124 

3 

150 

1 

154 (4) 

154 

3 

200 

1 

204 (4) 

204 

4 

120 

1 

126 ( 6 ) 

126 

5 

120 

1 

128 ( 8 ) 

128 

8 

120 

1 

134 (14) 

134 

3 

120 

2 

126 ( 6 ) 

126 

3 

120 

3 

128 ( 8 ) 

128 

3 

120 

4 

130 (10) 

130 

3 

120 

5 

132 (12) 

132 


Table 2: The size of the support. For a given number of treatments v, number of times n and 
degree of the time trend D, the column Simplex contains the size of the support of the design 
calculated using the simplex method (and the number of support points over the minimum 
size of the support, n, in parentheses); the column Max. Simplex contains the theoretical 
bound on the maximum number of support points given by Proposition!^ Note that for each 
of the studied cases the theoretical bound on the support has been exactly achieved. 


The conditions (ii) in A can be expressed as two sets of conditions: (ii.a) {v—l)b conditions 

for u = 2 ,... , 1 ; and h = and (ii.b) {v- 

!)(£) + 1 ) conditions ^ 2 )p(t 2 ) = ti, t 2 )p(t 2 ), u = 2,...,v. 

By summing (ii.a) over all ti, and using the fact that = 1) we obtain (i), which 

reduces the number of linearly independent rows in A by u. Similarly to Proposition [5l using 
(i), it follows that there are at most (u — 1)(& + r) + n — 1 linearly independent rows in A, 
where r is the alEne dimension of the set {p{t 2 )}t 2 - Since po^po represents the constant term, 
the number of linearly independent rows in A is at most {v — 1)(6 + D) + n — 1 = 33. The 
minimum number of support points is n = 24. 

Using the simplex method, we obtained an if-optimal balanced approximate design 
see Table m The design has the support of size 30, which exceeds the minimum support 
size by 6 , and it is fixed in 18 out of the 24 positions. 

By a complete enumeration of treatments in the 8 non-fixed positions, we obtained an exact 
design ^ : 6 i = 12131213, 62 = 11312132, 63 = 32231111, where the sequence bj determines the 
treatments and their positions in block j. Using Theorem [ 6 l we get that ^ has approximate 
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block 

u\t 

1 

2 

3 

4 

5 

6 

7 

8 

1 

1 

0.0417 

0 

0.0417 

0 

0.0417 

0 

0.0417 

0 


2 

0 

0.0417 

0 

0 

0 

0.0417 

0 

0 


3 

0 

0 

0 

0.0417 

0 

0 

0 

0.0417 

2 

1 

0.0417 

0.0417 

0 

0.0324 

0 

0.0417 

0.0093 

0 


2 

0 

0 

0 

0 

0.0417 

0 

0 

0.0417 


3 

0 

0 

0.0417 

0.0093 

0 

0 

0.0324 

0 

3 

1 

0.0046 

0 

0.0083 

0 

0.0417 

0.0417 

0.0370 

0.0333 


2 

0 

0.0417 

0.0333 

0 

0 

0 

0 

0.0083 


3 

0.0370 

0 

0 

0.0417 

0 

0 

0.0046 

0 


Table 3: i?-optimal balanced approximate design obtained by a simplex method for an exper¬ 
iment with 3 blocks, each of size 8, and a common trend effect. 


efficiency 0.999. 


□ 


Appendix 


Proof of Proposition [T] 

Let us partition the matrix L in 

Nk{0 = 

sls L = where Li is an s x n and L 2 is an s x d matrix. Then, 

Nk{0 = niin = min {Li, L 2 )M{^){Li, L 2 )^ 

LK=I. (Li,L2)(Q^.0)t=7, 

^ min LiMii{^)L{ = Nq{w). 

LiQ=Is 


From now on, we assume that ,0sxd) ■ 


□ 


Lemma 1. Let M be a non-negative definite matrix. If a design ^ satisfies M{f)M K = K 
for some generalized inverse M~ of M, then (i) f is feasible for K'^fi and (ii) M~ {^)K = 
K'^M-K. 


Proof. The steps of the proof follow the proof of Theorem 8.13 from iPukelsheimI 2006j. We 
denote G := M~. Since M{f)GK = K, we obtain M{ff)X = K, where X = GK. Therefore 
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C{K) C C{M{^)) and hence ^ is feasible. Let us premultiply the equation M{^)GK = K hy 
K'^M~{^) so that we obtain on the right-hand side M~ {^)K. The left-hand side is then 
equal to M~{^)M{^)GK. Note that and hence the following 

holds 


K^M-{C)M{C)GK = x'^ {C)M{C)GK = X’^M{^)GK = K^GK. 

It follows that □ 

Lemma 2. Let w > 0 be a treatment proportions design and let G := diag(ui“^, Od) ■ Let ^ 
be a design in model m, then ^ satisfies M(fi)GK = K if and only if (i) w is a treatment 
proportions design off, and (ii) f is resistant to nuisance effects. 

Proof We may express M{f)GK = K as Mii(^)diag(ic“^)(5 = Q and Mj^(^)diag(r(;“^)Q = 
0. Since both Mii{f) and diag('u;“^) are diagonal matrices, and all rows of Q are as¬ 
sumed to be non-zero vectors, the first equation is equivalent to ~ ^ 

u, which is (i). From the second equation, we obtain that every row of M]^(^)diag(r(;“^) = 

it:T,t^iv,t)h{t) 

Proof of Proposition!^ 

Let M := diag(r(;,0^). Then, G := diag(r(;“^,0^) is a generalized inverse of M. From 
Lemma [2] it follows that M{f)GK = K and from Lemma [T] it follows that (i) and (ii) hold. 
The statement (iii) is a direct consequence of (ii). □ 


needs to be in Af{Q’^), which is (ii). □ 


Proof of Theorem [T] 

Let ^ be a feasible design in ([T]). Using Proposition [TJ we obtain that Nxif) di Nq{w), 
where w is the treatment proportions design of f. Moreover, since product designs are nuisance 
resistant, part (iii) of Proposition [2] implies that Nq{w) = Nk{w 0 a) for any nuisance 
conditions design a. Therefore, Nxif) ^ Nk{w 0 a). 

Suppose that w* is not a ^-optimal design. Then, there exists a design Wh under model 
([H) such that $(iV( 3 (w*)) < (l>(NQ(wb)). Then, <!>(Xx(f*)) < (I?(Xq(w*)) < <P(NQ(wh)) = 
<l>(Xx(u’b 0 a)) for any nuisance conditions design a. That is a contradiction with f* being 
^-optimal. □ 


200 


i). 


Let <i> 6e a strictly concave information 


Lemma 3 (Theorem 8.13 from iPukelsheim 
function and let f* be ^-optimal for iL^/3. Let G be a generalized inverse of M{f*) that 
satisfies the normality inequality of the General Equivalence Theorem (Theorem 7.14 from 
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Pukelsheim \ 2 OOt I). i.e., there exists a non-negative definite matrix D that solves the polarity 
equation 

^{NK{C))^°^{D)=tv{CD) = 1 , 

where is the polar information function of ^ fsee iPukelsheim f200aj }. and G satisfies the 
normality inequality 

tr(Af(^)i?) < 1 for all feasible designs 

where B = GKNK{f,*)DNK{f,*)K"^G"’". Then, a design is ^-optimal if and only if M{£f)GK = 

K. 


In order to use Lemma |31 we need to obtain a matrix G that satisfies the normality 
inequality of the General Equivalence Theorem. 


Lemma 4. Let ^ he a strictly concave information function, let w* be a ^-optimal treatment 
proportions design and let G := diag((ty*)“^,0^). Then, G satisfies the normality inequality 
of the General Equivalence Theorem for estimating K'^fi in model m- 

Proof. Let us denote N* := Nq(w*) and Gn := diag((t(;*)”^). Since w* is optimal in ([31), the 
matrix Gn that is the unique generalized inverse of M{w*), satisfies normality inequality of 
the General Equivalence Theorem for model ([3|), i.e. there exists a matrix D which satisfies the 
polarity equation = tT{N*D) = 1 and the matrix B^, = GiiQN*DN*Q"^Gii 

satisfies the normality inequality tr{M{w)Bw) < 1 for all w. 

There exists a unique ^-optimal information matrix because $ is strictly concave. 

Since Nk^w* G a) = Nq{w*) = N* is ^-optimal, we have Nxif*) = N*. Thus, the polarity 
equality holds in model © for the same matrix D. Let ^ be a feasible design. Then, the 
left-hand side of the normality inequality in model ([T]) is tr{M{f)B), where 


B = 


Gn 

0 


N*DN* 


0 


Gn 

0 


0 

0 


By, 0 
0 0 


Then, because By, satisfies the normality inequality in model ([3|), we obtain tr{M{f)B) = 
tr{Mii{^)By,) = ti{M{w)By,) < 1, where w is the treatment proportions design of □ 


Proof of Theorem 

Let ^ be a design in model m and w be its treatment proportions design. Since $ is 
isotonic, from Proposition [T] it follows that $(7 Vq(w)) > ^{Nk{£,)). Since w* is <l>-optimal, 
^{Nq{w*)) > ^{Nq{w)) and it is feasible, thus w* > 0. Using Proposition |31 we get that 
$(iVx(D) = ^{Nq{w*)) > ^{Nq{w)) > i-e-, T is ^-optimal. □ 
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Proof of Theorem [3] 

Let G = diag((t(;*)“^, Od). From Lemma[4]it follows that G satisfies the normality inequality 
of the General Equivalence Theorem. Lemma [3] yields that a design ^ is <i>-optimal if and only 
if M{S)GK — K. The equality M{^)GK = K holds if and only if ^ satisfies (i) and (ii) from 
Lemma [H □ 


Proof of Theorem 

First, assume that Q has full column rank. Let ic be a feasible treatment proportions 
design and let P be a z; x z; permutation matrix. We define Pw to be the design given by 
the P-permutation of treatments in w, i.e., Pw(u) = w{ttp{u)) for u G {1,... ,z;}, where TVp 
is the permutation of elements n} corresponding to the matrix P. Since Pw > 0, 

it is feasible, its moment matrix is M{Pw) = PM{w)P"^ and it has information matrix 
Nq{Pw) = {Q^PM-\w)P^Q)-\ 

We will use the well-known fact that if X is any matri x, the non-ze ro eigenvalues of 
the matrices X^X and XX^ are the same (e.g., 6.54(c) in ISeben 200g), including mul¬ 


tiplicities. Define Y = Q'^ and Z = PM~^/'^{w). Since QQ^ is completely 

symmetric, = Z"^Z. Furthermore, = Q'^M~^{w)Q = Nq^{w) and ZZ"^ = 

Q"^PM~^{w)P"^Q = Nq^{Pw), thus Nq{w) and Nq{Pw) have the same set of non-zero 
eigenvalues. Since they have the same (full) rank, it follows that Nq{w) and Nq{Pw) are or¬ 
thogonally similar and $(Pzz;) = $(zz;). Note that analogous results hold in the rank-deficient 
case for the matrices Cq(w) and Gq{Pw). 

The uniform treatment design satisfies 


\ P—perm. / P—perm. 

= ^ $(zn) = ^z;!$(zz;) = <i)(zc), 

y\ 7,1 


V\ 


P —perm. 

where the inequality follows from the concavity of $. Thus, w is ^-optimal. 


□ 


Proof of Theorem [6] 

Note that for Q = {—Ig 0 ^v-g, Ig <8 Iv-gY" we have 


QQ^ = 


9)Ig Jgx{v-g) 

~J{v—g)xg d^v—g 


25 







Let w be a treatment proportions design, let Pi, P2 he g x g and {v — g) x (v — g) permutation 
matrices, respectively, and let 


P = 


Pi hgx{v-g) 
P(v-g)xg P2 


( 16 ) 


Define Pw to be the design given by the P-permutations of the treatments. Then M{Pw) = 
PM{w)P"^ and P^QQ^P — QQ^. From an argument analogous to the proof of Theorem [51 
Cq{Pw) and Cq{w) are orthogonally similar and ^p{Pw) = $p(w). 

Define w = Pw, where the sum is over all r; x u permutation matrices P of 

the form (fTBl) . Then $p(w) > $p(ra). It follows that an optimal design exists in the class of 
designs that allocate one weight to each of the first g treatments, say 71 (0 < 71 < l/g), and 
another weight to each of the other treatments, 72 := (1 — gh)/{v — g). Let 7 := g^i be the 
total weight of the first g treatments and for a given 7, we denote such designs as Wj. 

The non-zero eigenvalues of Cq(w^) are inverse to the non-zero eigenvalues of V{w^) := 
where M(w^) = diag(7ilg,72l«-g). Let X = Q '^Then the set 
of non-zero eigenvalues of V(wP = X^X coincides with the set of non-zero eigenvalues of 


XX^ = M-P^{w^)QQ^M-P‘^{w^) 


{v-g)li^Ig -(7172) ^/Vgx(„-g) 
— (7172) J{y—g)xg 572 ^v—g 


It can be seen that XX^ has the following eigenvalues, listed with the corresponding eigen¬ 
vectors X = where xi G R® and X2 G pi = 57^^ with multiplicity (w.m.) 

V - g - 1 , xi = Og and l^_gX2 = 0 ; p 2 = (w - ghi^ w.m. g - 1 , ijxi = 0 and X2 = Ov-g', 
= (v - 5)71”^ + 912^ w.m. 1, xi = -{v - ghl^^lg and X2 = 57i^^l„_g; and p4 = 0 w.m. 
1, xi = 7 i^^lg and X2 = 

Therefore, the non-zero eigenvalues of Cq{wj) are Ai = g^PJg'f w.m. v — g — 1 , X2 = 
w.m. p — 1 , A3 = w.m. 1 . Thus for p G (—c», 0 ), the $p-optimal Wj is obtained by 

minimizing the convex function 


fpili) 


{v 


5-1) 


/ 1-571 

\ 9 {v- g) 


+ (5-1) 


7 i 

V- g 


+ 


f 71(1-571) 

V 'v-g 


p 


Then /^(7i) = 0 if and only if 


-{v - g- 1 ){ 1 -g-fif ^ + (5 - l)( 57 i)^ ^ + (1 - 257 i)( 57 i)^ ^(l- 57 i)^ ^ = 0 


which is equivalent to 

-{v - g - l)( 57 i)^“^ + (5 - 1)(1 - 571)^"^ + 1 - 2571 = 0- 
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Using 7 = (771, we obtain 

If we set p = 0 in we obtain 7 = gjv^ which means that is a uniform design. Such 
design is indeed D-optimal, because it is well known that the uniform design is Z?-optimal for 
any system of contrasts. 

The smallest non-zero eigenvalue of Cq(wj) is A3 and hence the $_00-optimal design can 
be obtained by maximizing 


/-oo( 7 ) 


7(1 - 7) 
9 {y- 9) 


which has maximum in 7 = i. 

Note that even in the case g = 1 , where Q is not rank deficient, the eigenvalues of V{w) 
are inverses of the eigenvalues of Nq (w) and thus our results hold. □ 


Proof of Theorem 0 

This proof will closely follow the proof of Theorem [6l The covariance matrix of the 
least-square estimators is proportional to V{w) = Q'^M~^{w)Q. Note that since the MV- 
optimality criterion ^mv depends only on the diagonal of the variance matrix, it is permu- 
tationally invariant. 

Let w be a treatment proportions design and let P, w, 71, 72, 7 and Wj be defined as in the 
proof of Theorem [SI Then Q^P = BQ^ ^ where B = Pi ^ P2, which is a permutation matrix. 
Thus V{Pw) = BV{w)B'^ , ^Mv{Pw) = ^mv{w) and ^mv{w) < ^mv{w). Similarly, 
^Mv{w') < ^Mv{w). It follows that an optimal design exists in the class of designs w-y. 

We have V{w^) = 'yi^Ig^Jy-g+^^^Jg^Iy-g and all its diagonal elements are 7r^ +7^^- 
Thus the optimal 71 may be obtained by minimizing 

fMvill) = -k -j- 

1-571 

which has minimum in yf = v/gl" , thus 7* = □ 

Lemma 5 . Let /,m G N, let be an exact design of size I and let ^ = ^p£,p---£,p be the exact 
design of size n = Im formed by an m-fold replication of ^p. Assume that a G N is not an 
integer multiple of m. Then, ^ is balanced for the nuisance regressors of the form cos(a 0 „t) 
and sin{a(j)nt), t = 1 , ..., n. 

Proof. Let m £ { 1 ,..., 1;}. Using the fact that f{u, k + Ij) = fp{u, k) for all fc £ { 1 , ..., 1 } and 
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j G {0 ,m — 1}, we obtain 


t) cos(a0nO + i t) sin(a0nO = 


-1 i 


i=l 
' m— 1 




a4>riki 


j —0 k —1 


\k^l 


g(a</>nH)i 

,i=0 


Note that if a is not an integer multiple of m then a(j)nl = 27 r(a/m) is not an integer multiple 
of 2 tt, which implies 7^ 1 . In that case 


E^^ 


1 _ pa27ri 

(a0n/i)j ^ ^ ^ _ Q 


1 — e' 


aSnli 


□ 


Proof of Proposition!^ 

The proposition follows from Theorem [ 2 ] and Lemma [3 □ 

Proof of Proposition [3 

It is well known that a point x is a vertex of the set {x|Ax = 6, x > 0 } if and only if the 
system {Aj\xj > 0 }, where Aj is the j-th column of A, has full rank. 

The matrix A consists of ?; + (?; — l)fi + n rows, but they are linearly dependent. Let k be 
the affine dimension of {h{t)}t^‘x and, without the loss of generality, let T = { 1 ,..., n}. Then, 
the matrix [h{ 2 ) — h{l ),..., h{n) — h(l)] has rank k and thus its row space has dimension k. 
That is, without the loss of generality, we obtain that hi{t) — hi{l) = ihj{t) — hj(l)) 

for some ..., G R, for i > fc and t G { 1 ,..., n} (for t = 1 , we formally get 0 = 0 ). Let 
u G {!,..., u}. Then, if (ii) is satisfied in the first k coordinates of h, i.e., for hi,..., hj,, we 
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have for aW i > k and u G 


K K 

t j=i t j=i t 

k k 

= h^{l) - '^cl"'^hj{l) + '^cf'’w-^'^^{u,t)hj{t) 

j=i j=i t 

k k 


= w. 


t ■ ' 


-fcf,. 


i=i 


i=i 

k 

i=i 


where the second and the third equality hold because of (i). It follows that (ii) provides at 
most k(v — 1 ) additional linearly independent equalities. 

If ^ satisfies (i), it holds that ^^{u,t) = 1. Thus, if ^ satisfies (hi) for t = 1,..., n — 1, 
we have 1 = Ylt=i I]« + Su + I]« ?(■“> therefore (hi) holds also 

for t = n. That is, (iii) provides only n — 1 additional linearly independent equalities. Hence, 
the rank of A is at most v + (v—l)k + n—l and a vertex x contains at most u+(u — l)A: + n—1 
support points. 

□ 
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